A Generic Data Harmonization Process for Cross-linked Research and Network Interaction

نویسندگان

  • D. Firnkorn
  • M. Ganzinger
  • T. Muley
  • M. Thomas
  • P. Knaup
چکیده

Objective: Joint data analysis is a key requirement in medical research networks. Data are available in heterogeneous formats at each network partner and their harmoni zation is often rather complex. The objective of our paper is to provide a generic approach for the harmonization process in research networks. We applied the process when harmonizing data from three sites for the Lung Cancer Phenotype Database within the German Center for Lung Research. Methods: We developed a spreadsheet-based solution as tool to support the harmonization process for lung cancer data and a data integration procedure based on Talend Open Studio. Results: The harmonization process consists of eight steps describing a systematic approach for defining and reviewing source data elements and standardizing common data elements. The steps for defining common data elements and harmonizing them with local data definitions are repeated until consensus is reached. Application of this process for building the phenotype database led to a common basic data set on lung cancer with 285 structured parameters. The Lung Cancer Phenotype Database was realized as an i2b2 research data warehouse. Conclusion: Data harmonization is a chal lenging task requiring informatics skills as well as domain knowledge. Our approach facilitates data harmonization by providing guidance through a uniform process that can be applied in a wide range of projects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Generic Data Harmonization Process for Cross-linked Research and Network Interaction. Construction and Application for the Lung Cancer Phenotype Database of the German Center for Lung Research.

OBJECTIVE Joint data analysis is a key requirement in medical research networks. Data are available in heterogeneous formats at each network partner and their harmonization is often rather complex. The objective of our paper is to provide a generic approach for the harmonization process in research networks. We applied the process when harmonizing data from three sites for the Lung Cancer Pheno...

متن کامل

Optimal Harmonization of Out-Network Traffic Control Regulations in Social Networks

Regulations of use of social networks, as one of the key components in these networks, serve an important role in controlling the flow of traffic. The study of the harmonization of these terms and regulations can be a significant step to avoid congestion and (Users’) rejection in the network. Harmonization of traffic control regulations (TCR) among social networks is one of the best solutions t...

متن کامل

Adsorption Mechanism for Aniline on the Hypercross-Linked Fiber

A type of novel hypercross-linked fiber adsorbent was obtained by sulfonation and cross-linking reaction of polypropylene fiber grafted styrene-divinylbenzene. The aim of the fiber sulfonation and cross-linking method was to prepare rigid three dimensional networks in the entire fiber and change the ion exchange capacity of fiber. The hypercross-linked fiber adsorbent possesses a principall...

متن کامل

Maelstrom Research guidelines for rigorous retrospective data harmonization

Background It is widely accepted and acknowledged that data harmonization is crucial: in its absence, the co-analysis of major tranches of high quality extant data is liable to inefficiency or error. However, despite its widespread practice, no formalized/systematic guidelines exist to ensure high quality retrospective data harmonization. Methods To better understand real-world harmonization ...

متن کامل

Interval Efficiency Assessment in Network Structure Based on Cross –Efficiency

As we know, in evaluating of DMUs some of them might be efficient, so ranking of them have a high significant. One of the ranking methods is cross-efficiency. Cross efficiency evaluation in data envelopment analysis (DEA) is a commonly used skill for ranking decision making units (DMUs). Since, many studies ignore the intra-organizational communication and consider DMUs as a black box. For sign...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017